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HOW  MANY  SPECIMENS?  - 

AN  AID  IN  THE  DESIGN  OF  FATIGUE  TEST  PROGRAMS 


by 

A.S.  Machin 


SUMMARY 


Increasing  fatigue  test  sample  sizes  increases  the 
accuracy  with  which  population  statistics  may  be  calculated 
and  increases  the  sensitivity  of  tests  comparing  different 
groups  of  results,  but  it  also  increases  testing  costs. 
Information  is  presentated  which  will  aid  the  designer  of  a 
fatigue  test  program  in  deciding  sample  sizes. 
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NOTATION 


size  of  sample 

sample  standard  deviation 

weighted,  combined  standard  deviation  of  two  sample 

percentage  point  of  Student's  t-distribution  with  m 
degrees  of  freedom  and  significance  a 

sample  mean 

percentage  point  of  chi-square  distribution  with  m 
degrees  of  freedom  and  significance  a 

population  mean 

population  standard  deviation 


1. 


INTRODUCTION 


The  purpose  of  this  memorandum  is  to  aid  an  experimenter 
with  a  portion  of  the  design  of  Fatigue  Test  programs;  the  question  of 
how  many  specimens  to  test  at  each  condition.  None  of  the  information 
is  novel  or  original,  but  it  may  be  useful  to  those  who  are  experimenters 
first  and  statisticians  second.  There  is  an  extensive  literature  on 
the  subject  of  Experimental  Design  (e.g.  Refs.  1,2)  which  should  be 
studied,  in  particular,  those  sections  on  randomisation  in  experimental 
design  should  be  noted,  since  valid  statistical  inference  depends  on 
having  a  'random  sample'. 

Fatigue  testing  can  be  expensive  (Appendix  1) .  The 
overall  cost  of  a  Fatigue  Test  program  can  be  reduced  by  minimising  the 

number  of  specimens  tested,  but  it  is  essential  to  test  enough  specimens 

* 

so  that  reliable  statistical  inferences  may  be  extracted  from  the 
results.  Given  that  resources  are  often  limited  it  may  be  necessary 
to  reduce  the  scope  of  a  test  program  so  that  a  restricted  amount  of 
reliable  information  is  obtained  rather  than  a  quantity  of  potentially 
inconclusive  data. 

2 .  STATISTICAL  ASSUMPTIONS 


In  the  analysis  of  fatigue  test  data,  a  common  practice 
is  to  assume  that  the  logarithms  (to  base  10)  of  the  fatigue  lives  of 
specimens  are  members  of  a  population  belonging  to  a  normal  distribution. 
Other  theoretical  frequency  distributions  are  used  (e.g.  the  various 
extreme-value  distributions)  but  the  information  presented  in  this 
Memorandum  is  derived  from  analysis  of  the  normal  distribution.  For 
distributions  which  are  not  quite  normal,  the  results  presented  are 
adequate  for  most  practical  purposes  (Ref.  3). 

The  sample  statistics  first  calculated  when  analysing 
fatigue  test  data  are  the  standard  deviation  (s)  and  the  mean  (x),  which 
is  usually  reported  as  antilog  (x) .  Once  these  have  been  calculated, 

*  Methods  of  statistical  inference  may  be  complex,  in  this  Memorandum 
only  three  basic  techniques  will  be  examined. 


much  fatigue  analysis  involves  comparing  different  groups  of  data 
(using  a  t-test)  to  see  whether  differences  between  sample  means 
are  significant. 


The  following  sections  show  how  the  accuracy  of  estimation 
of  population  parameters  such  as  the  mean  and  standard  deviation  depend 
on  the  size  of  the  sample  and  also  how  sample  sizes  influence  the 
sensitivity  of  a  t-test. 


3.  INTERVAL  ESTIMATION  OF  MEANS 


If  x  and  s  are  the  mean  and  standard  deviation  of  a 

sample  of  size  n  taken  from  a  normal  distribution  N(u,o2)  where  y  and 

* 

o  are  unknown  ,  then: 


S  ^  -  S 

x-t  ,  ^  y  x  +  t  ..  — 

n“1,a/2  Jii  n-l,a/2  ^ 


defines  a 


100(l-a)%  confidence  interval  for  y  (Ref.  3). 


Relation  (1)  may  be  modified  to  the  form: 

I  “  - J  I  :  Vi.o/2 1 

Since  it  is  the  logarithms  of  specimens  lives  which 
are  assumed  to  belong  to  a  normal  distribution,  it  follows  that 


re  assi 

Jv-x  | 


10  will  give  the  ratio  between  the  antilogs  of  the  population 

mean  y  and  the  sample  mean,  x. 

This  information  is  presented  in  Figures  1  to  7  for  sample 
standard  deviations  of  0.04,  0.06,  0.08,  0.10,  0.12,  0.16  and  0.20 
respectively.  In  each  Figure  information  is  given  for  confidence 
intervals  of  80%,  90%,  95%  and  99%+. 


*  In  fatigue  test  data  analysis  it  is  most  unlikely  that  the  population 

standard  deviation  will  be  known.  If  it  is  known,  the  interval  estimation 
of  the  mean,  with  a  small  sample  size,  will  be  more  precise. 

+  For  all  Figures  in  this  Memorandum  the  values  of  the  relationships 

presented  are  calculated  only  at  integer  sample  sizes  and  related  points 
are  connected  by  straight  line  segments.  Do  not  interpolate  between 
calculated  points. 
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Figures  1  to  7  show  that  errors  in  estimation  of  the 

population  mean,  y,  (by  using  the  sample  mean  x)  can  be  large.  For 

* 

a  sample  standard  deviation  of  0.10  ;  then,  if  the  sample  has  three 
members,  there  is  a  1  in  5  chance  of  the  estimate  of  y  being  in  error 
by  at  least  28%,  and  a  1  in  20  chance  of  the  estimate  being  in  error 
by  77%  (Fig.  4) .  Even  with  a  large  sample  (by  fatigue  testing  practice) 
there  is  a  limit  to  the  accuracy  of  estimation  of  y.  With  11  test 
results  in  a  sample  of  standard  deviation  0.10,  there  is  a  1  in  5 
chance  of  the  estimate  of  y  being  more  than  10%  in  error  and  a  1  in 
20  chance  of  the  estimate  being  more  than  17%  in  error. 

The  data  presented  in  Figures  1  to  7  also  show  that 
improvements  in  accuracy  of  estimation  of  the  population  mean  are 
dramatic  between  a  sample  size  of  three  and  a  sample  size  of  five  or 
six  but  that  increasing  the  sample  size  further  yields  much  smaller 
improvements.  Using  again  a  san^le  of  standard  deviation  0.10;  if 
there  are  three  results  in  the  sample  there  is  a  1  in  20  chance  of 
the  estimate  of  the  mean  being  more  than  77%  in  error.  If  the  testing 
cost  is  increased  by  doubling  the  sample  size  and  six  results  obtained 
there  is  a  1  in  20  chance  of  the  estimate  of  population  mean  being 
more  than  27%  in  error.  If  the  testing  cost  is  again  increased  and 
the  sample  consists  of  11  results  there  is  a  1  in  20  chance  of  the 
estimate  of  the  population  mean  being  more  than  17%  in  error. 

4.  INTERVAL  ESTIMATION  OF  STANDARD  DEVIATION 


If  s  is  the  standard  deviation  of  a  sample  of  size  n  from 
a  normal  distribution  (y,oa)  then: 


n-1 


n-1, a/2 


£ 

s 


n-1 


*n-l,l-a/2 


(3) 


defines  a  100(l-a)%  confidence  interval  for  c  (Ref.  3). 


Standard  deviations  of  about  this  magnitude  are  common  if  the  testing 
is  done  in  an  electro-hydraulic  fatigue  testing  machine  (e.g.  the 
results  presented  in  Ref.  4)  but  the  data  presented  in  Ref.  5  suggest 
that  larger  standard  deviations  mav  expected. 
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Figure  8  presents  the  relation  (3)  for  confidence  intervals 
of  80%,  90%,  95%  and  99%.  This  Figure  shows  the  difficulty  of  making 
an  accurate  estimate  of  a  population  standard  deviation,  even  with  (by 
fatigue  test  practice)  quite  large  sample  sizes.  For  each  confidence 
interval  the  degree  by  which  the  population  standard  deviation  may  be 
under-estimated  (by  using  s)  is  greater  than  the  degree  to  which  it 
may  be  over-estimated. 


As  with  estimation  of  population  means,  a  sample  size 
of  five  or  six  gives  an  estimate  of  population  standard  deviation  which 
approaches  the  accuracy  obtainable  with  a  sample  size  of  11  and  is  much 
better  than  the  accuracy  obtained  with  a  sample  size  of  three. 

5.  THE  t-TEST 


This  test  may  be  used  to  establish  whether  there  is  a 

* 

significant  difference  between  two  sample  means  .  If  the  means  of  the 
parent  populations  are  and  respectively  and  the  hypothesis  is 
made  that  =  p2»  then  this  hypothesis  may  be  tested  by  the  relation 
(Ref.  3) : 


+  n. 


2,  a/2 


h  :  -2 _ <t 

S  (1/n  +  1/n  )  $  n.+n  -2,  a/2 
c  1  2  12 


(4) 


where  S  is  the  combined  sample  standard  deviation  and  is  given  by: 
c 


.  /'"i-vy  +  <",-1>VA  t 

>C  "  V  n!  +  ”2  -  2  J 


and  where  a  is  the  probability  of  rejecting  the  hypothesis  when  it  is 
correct+. 


*  Sample  variances  should  be  compared  first,  using  an  F-test,  to  check 
that  population  variances  are  not  significantly  different. 

+  Rejecting  a  correct  hypothesis  is  a  Type  1  error.  Users  of  the 
t-test  should  bear  in  mind  the  possibility  of  Type  II  errors, 
i.e.,  accepting  as  correct  a  hypothesis  that  is  incorrect.  The  level 
of  Type  II  errors  depends  on  the  deviation  between  the  two  populations, 
which  in  practice  can  not  be  determined. 


falls  outside  the  limits  given  in  (4) 


v--  ooo 


-5- 


If 


x^  -  x„ 


S  (1/n  + 1/n  ) 7 
cl  2 

above,  then  the  hypothesis  that  p  =  is  rejected  and  there  is  a 
significant  difference  between  the  sample  means. 


The  relation  (4)  may  be  re-arranged  to: 


5l-52  !  <tn1  +  n2-2,a/2  Sc  (1/n!  +  ^ 


(5) 


!v,_; 


Since  logarithms  of  specimen  lives  are  used,  it  follows 


that  lO1*1  X2 1  will  give  the  ratios  between  the  antilogs  of  the  two 
samole  means. 


Relation  (5)  is  presented  in  Figures  9,  10,  11  for 
confidence  levels  of  10%,  5%  and  1%  respectively  and  for  combined 
sample  standard  deviations  of  0.06,  0.10,  0.12  and  0.16.  The  simpli¬ 
fying  condition  that  n^  =  n 2  has  been  made. 

These  Figures  show  the  effect  on  the  sensitivity  of  a 
t-test  of  the  sample  size  and  combined  sample  standard  deviation,  and 
enable  an  estimate  to  be  made  of  the  differences  in  mean  life  that  must 
exist  between  samples  before  the  t-test  will  show  them  to  be  significantly 
different.  Using  Fig.  10  (a  =  0.05)  as  an  example;  if  the  combined 
sanqsle  deviation  is  0.10  then,  for  samples  of  six  results  each,  a  life 
difference  of  20%  will  not  be  significant  but  a  life  difference  of  25% 
will  be  significant. 

Even  with  large  sample  sizes,  the  differences  in  mean 
lives  have  to  be  considerable  before  it  can  be  shown  that  they  are 
significant.  For  example,  for  a  *  0.05,  a  combined  sample  standard 
deviation  of  0.10  (Fig.  10)  and  two  samples  of  11  results  each,  the  mean 
lives  of  the  samples  must  differ  by  more  than  16%  for  them  to  be  sign¬ 
ificantly  different. 


Samples  of  five  or  six  results  each  should  provide  a 
good  compromise  between  fatigue  test  costs  and  t-test  sensitivity. 


6. 


CONCLUDING  REMARKS 


In  a  fatigue  test  program  sample  means  or  standard 
deviations  cannot  be  nominated  before  testing  (though  a  literature 
search  of  the  results  of  similar  tests  may  give  a  guide  to  the  expected 
magnitude  of  standard  deviations) .  This  being  the  case  it  is  impossible 
to  examine  the  Figures  in  this  Memorandum  and  so  decide  in  advance  the 
numbers  of  specimens  to  be  tested  to  provide  a  desired  accuracy  of 
statistical  inference. 

The  information  presented  here  should  provide  the  reader 
with  some  appreciation  of  the  limits  of  accuracy  of  estimation  of 
population  parameters  when  using  the  small  sample  sizes  that  are  usually 
forced  by  constraints  on  test  program  cost  and  time.  It  has  been  shown 
that,  accuracy  improves  dramatically  as  the  sample  size  is  increased  to 
five  or  six  results,  but  improvement  is  less  when  sample  sizes  are 
increased  further.  It  is  up  to  the  experimenter  to  decide  when  the 
costs  of  increasing  sample  size  outweight  the  benefits  of  improved 
accuracy. 
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APPENDIX  I 

COST  OF  TESTING  SPECIMENS 

A  test  program  designed  to  improve  the  fatigue  life 
of  the  main  spar  of  the  Mirage  IIIO  wing  spar  was  recently  completed 
by  Structures  Division  of  Aeronautical  Research  Laboratories.  During 
this  program  246  moderately  complex  specimens  were  tested  in  about 
2.5  years,  i.e.  a  testing  rate  of  100  specimens  per  year.  A  very 
simple  estimate  of  the  direct  cost  is  given  below. 


COSTS 

(1) 

Material 

Supplied  by  RAAF, 
no  charge  to  A.R.L. 

(2) 

Machining  of 
specimen  blanks 

$250  per 

specimen 

(3) 

Cost  of  using 
testing  machine 

(a) 

$200  per 

specimen 

(4) 

Salaries 

(b) 

$450  per 

specimen 

TOTAL 

$900  per 

specimen 

(a)  Assuming  a  machine  life  of  10  years  and  a  capital  cost  of  $100,000 
(1983  dollars)  for  an  electrchydraulic  fatigue  testing  machine  and 
assuming  maintenance  and  running  costs  of  $100,000  over  10  years, 
then  cost  per  year  will  be  $20,000.  If  100  specimens  are  tested 
per  year,  the  cost  per  specimen  will  be  $200. 

(b)  Assuming  direct  salary  costs  of  $45,000  per  year,  i.e.  approximately 
one  sub-professional  and  one  professional  officer.  : 

In  addition  to  the  costs  above,  various  items  of  equipment 
were  purchased  OS.  part  of  the  program  and  many  man-months  were  spent 
writing  the  final  Report. 


CONFIDENCE  INTERVALS  FOR  MEANS  OF  SAMPLES  WITH  A  STANDARD 
DEVIATION  OF  0.06.  POPULATION  STANDARD  DEVIATION  UNKOWN. 


Sample  standard  deviation 
Confidence  intervals  of: 


CONFIDENCE  INTERVALS  FOR  MEANS  OF  SAMPLES  WITH  A  STANDARD 
DEVIATION  OF  0.08.  POPULATION  STANDARD  DEVIATION  UNKNOWN. 
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EFFECT  OF  SAMPLE  SIZE  ON  t-TEST  SENSITIVITY 
AT  THE  10V.  LEVEL  OF  SIGNIFICANCE. 


EFFECT  OF  SAMPLE  SIZE  ON  t -TEST  SENSITIVITY 
AT  THE  5%  LEVEL  OF  SIGNIFICANCE. 


EFFECT  OF  SAMPLE  SIZE  ON  t -TEST  SENSITIVITY 
AT  THE  1%  LEVEL  OF  SIGNIFICANCE. 
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